253 research outputs found

    Reviving the Transcriptome Studies: An Insight Into the Emergence of Single-Molecule Transcriptome Sequencing

    Get PDF
    Advances in transcriptomics have provided an exceptional opportunity to study functional implications of the genetic variability. Technologies such as RNA-Seq have emerged as state-of-the-art techniques for transcriptome analysis that take advantage of high-throughput next-generation sequencing. However, similar to their predecessors, these approaches continue to impose major challenges on full-length transcript structure identification, primarily due to inherent limitations of read length. With the development of single-molecule sequencing (SMS) from PacBio, a growing number of studies on the transcriptome of different organisms have been reported. SMS has emerged as advantageous for comprehensive genome annotation including identification of novel genes/isoforms, long non-coding RNAs and fusion transcripts. This approach can be used across a broad spectrum of species to better interpret the coding information of the genome, and facilitate the biological function study. We provide an overview of SMS platform and its diverse applications in various biological studies, and our perspective on the challenges associated with the transcriptome studies

    Ranked Choice Voting for Representative Transcripts with TRaCE

    Get PDF
    Genome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcript. To choose canonical transcripts, TRaCE (Transcript Ranking and Canonical Election) holds an ‘election’ in which a set of RNA-seq samples rank transcripts by annotation edit distance. These sample-specific votes are tallied along with other criteria such as protein length and InterPro domain coverage. The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance. Based on the set of expression data provided, TRaCE can identify the most common isoforms from a broad expression atlas or prioritize alternative transcripts expressed in specific contexts

    CMap 1.01: a comparative mapping application for the Internet

    Get PDF
    Summary:CMap is a web-based tool for displaying and comparing maps of any type and from any species. A user can compare an unlimited number of maps, view pair-wise comparisons of known correspondences, and search for maps or for features by name, species, type and accession. CMap is freely available, can run on a variety of database engines and uses only free and open software components

    Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor

    Get PDF
    Individuals of the same species are generally thought to have very similar genomes. However, there is growing evidence that structural variation in the form of copy number variation (CNV) and presence-absence variation (PAV) can lead to variation in the genome content of individuals within a species. Array comparative genomic hybridization (CGH) was used to compare gene content and copy number variation among 19 diverse maize inbreds and 14 genotypes of the wild ancestor of maize, teosinte. We identified 479 genes exhibiting higher copy number in some genotypes (UpCNV) and 3410 genes that have either fewer copies or are missing in the genome of at least one genotype relative to B73 (DownCNV/PAV). Many of these DownCNV/PAV are examples of genes present in B73, but missing from other genotypes. Over 70% of the CNV/PAV examples are identified in multiple genotypes, and the majority of events are observed in both maize and teosinte, suggesting that these variants predate domestication and that there is not strong selection acting against them. Many of the genes affected by CNV/PAV are either maize specific (thus possible annotation artifacts) or members of large gene families, suggesting that the gene loss can be tolerated through buffering by redundant functions encoded elsewhere in the genome. While this structural variation may not result in major qualitative variation due to genetic buffering, it may significantly contribute to quantitative variation

    The Gramene Genetic Diversity Module: a resource for genotype-phenotype association analysis in grass species

    Get PDF
    The Genetic Diversity module of the Gramene database ("http://www.gramene.org/diversity":http://www.gramene.org/diversity) is specifically designed to handle data from high-throughput sequencing and array-based genotyping plateforms. Empowered by the Genomic Diversity and Phenotype Data Model, Gramene Genetic Diversity module provides live database connectivities of RFLP, SSR and SNP allele data, information about QTL, passport data for wild and cultivated germplasm from rice, maize, wheat, and _Arabidopsis_, and quantitative phenotypic data for some of these accessions. Large datasets of SNP variation are searchable via genomic positions of interest by SNP Query tool on a sequenced genome; and, trait associations, patterns of linkage disequilibrium and diversity can be evaluated using TASSEL. The Gramene database is updated twice a year, with the most recent release (Build #31) completed in May 2010

    Era of gapless plant genomes: innovations in sequencing and mapping technologies revolutionize genomics and breeding

    Get PDF
    Whole-genome sequencing and assembly have revolutionized plant genetics and molecular biology over the last two decades. However, significant shortcomings in first- and second-generation technology resulted in imperfect reference genomes: numerous and large gaps of low quality or undeterminable sequence in areas of highly repetitive DNA along with limited chromosomal phasing restricted the ability of researchers to characterize regulatory noncoding elements and genic regions that underwent recent duplication events. Recently, advances in long-read sequencing have resulted in the first gapless, telomere-to-telomere (T2T) assemblies of plant genomes. This leap forward has the potential to increase the speed and confidence of genomics and molecular experimentation while reducing costs for the research community

    Role of a ZF-HD Transcription Factor in miR157-Mediated Feed-Forward Regulatory Module That Determines Plant Architecture in Arabidopsis

    Get PDF
    In plants, vegetative and reproductive development are associated with agronomically important traits that contribute to grain yield and biomass. Zinc finger homeodomain (ZF-HD) transcription factors (TFs) constitute a relatively small gene family that has been studied in several model plants, including Arabidopsis thaliana L. and Oryza sativa L. The ZF-HD family members play important roles in plant growth and development, but their contribution to the regulation of plant architecture remains largely unknown due to their functional redundancy. To understand the gene regulatory network controlled by ZF-HD TFs, we analyzed multiple loss-of-function mutants of ZF-HD TFs in Arabidopsis that exhibited morphological abnormalities in branching and flowering architecture. We found that ZF-HD TFs, especially HB34, negatively regulate the expression of miR157 and positively regulate SQUAMOSA PROMOTER BINDING-LIKE 10 (SPL10), a target of miR157. Genome-wide chromatin immunoprecipitation sequencing (ChIP-Seq) analysis revealed that miR157D and SPL10 are direct targets of HB34, creating a feed-forward loop that constitutes a robust miRNA regulatory module. Network motif analysis contains overrepresented coherent type IV feedforward motifs in the amiR zf-HD and hbq mutant background. This finding indicates that miRNA-mediated ZF-HD feedforward modules modify branching and inflorescence architecture in Arabidopsis. Taken together, these findings reveal a guiding role of ZF-HD TFs in the regulatory network module and demonstrate its role in plant architecture in Arabidopsis

    Gramene: Development and Integration of Trait and Gene Ontologies for Rice

    Get PDF
    Gramene (http://www.gramene.org/) is a comparative genome database for cereal crops and a community resource for rice. We are populating and curating Gramene with annotated rice (Oryza sativa) genomic sequence data and associated biological information including molecular markers, mutants, phenotypes, polymorphisms and Quantitative Trait Loci (QTL). In order to support queries across various data sets as well as across external databases, Gramene will employ three related controlled vocabularies. The specific goal of Gramene is, first to provide a Trait Ontology (TO) that can be used across the cereal crops to facilitate phenotypic comparisons both within and between the genera. Second, a vocabulary for plant anatomy terms, the Plant Ontology (PO) will facilitate the curation of morphological and anatomical feature information with respect to expression, localization of genes and gene products and the affected plant parts in a phenotype. The TO and PO are both in the early stages of development in collaboration with the International Rice Research Institute, TAIR and MaizeDB as part of the Plant Ontology Consortium. Finally, as part of another consortium comprising macromolecular databases from other model organisms, the Gene Ontology Consortium, we are annotating the confirmed and predicted protein entries from rice using both electronic and manual curation
    • …
    corecore